feat: add TTL-based cleanup strategy for DatabaseTriggers collection #464

spa-raj · 2025-10-05T10:58:53Z

Implements automatic cleanup of completed/failed triggers using MongoDB TTL index to prevent unbounded growth of the DatabaseTriggers collection.

Changes:

Added expires_at field to DatabaseTriggers model for TTL tracking
Created MongoDB TTL index on expires_at with expireAfterSeconds=0
Added TRIGGER_RETENTION_DAYS setting (default: 30 days, configurable via env)
Updated mark_as_triggered() to set expiration time on completed triggers
Updated mark_as_failed() to set expiration time on failed triggers
PENDING/TRIGGERING triggers remain without expiration (never cleaned up)

Implementation:

MongoDB automatically deletes documents when expires_at timestamp is reached
TTL runs in background every 60 seconds (MongoDB default)
Retention period configurable via TRIGGER_RETENTION_DAYS environment variable
Only terminal states (TRIGGERED, FAILED) are marked for cleanup

Tests:

Added comprehensive unit tests for TTL expiration logic
Tests verify expires_at is set correctly for both TRIGGERED and FAILED states
Tests verify custom retention periods are respected
All 4 new tests passing

Also bumped python-sdk version to 0.0.3b2

Resolves #433

Implements automatic cleanup of completed/failed triggers using MongoDB TTL index to prevent unbounded growth of the DatabaseTriggers collection. Changes: - Added expires_at field to DatabaseTriggers model for TTL tracking - Created MongoDB TTL index on expires_at with expireAfterSeconds=0 - Added TRIGGER_RETENTION_DAYS setting (default: 30 days, configurable via env) - Updated mark_as_triggered() to set expiration time on completed triggers - Updated mark_as_failed() to set expiration time on failed triggers - PENDING/TRIGGERING triggers remain without expiration (never cleaned up) Implementation: - MongoDB automatically deletes documents when expires_at timestamp is reached - TTL runs in background every 60 seconds (MongoDB default) - Retention period configurable via TRIGGER_RETENTION_DAYS environment variable - Only terminal states (TRIGGERED, FAILED) are marked for cleanup Tests: - Added comprehensive unit tests for TTL expiration logic - Tests verify expires_at is set correctly for both TRIGGERED and FAILED states - Tests verify custom retention periods are respected - All 4 new tests passing Also bumped python-sdk version to 0.0.3b2 Resolves exospherehost#433

safedep · 2025-10-05T10:58:57Z

SafeDep Report Summary

Package Details

Package	Malware	Vulnerability	Risky License	Report

_{This report is generated by SafeDep Github App}

gemini-code-assist · 2025-10-05T10:59:07Z

Summary of Changes

Hello @spa-raj, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a crucial maintenance feature to automatically clean up old and completed or failed database triggers. By leveraging MongoDB's native TTL indexing, the system will now prevent the DatabaseTriggers collection from growing indefinitely, improving database performance and resource management. The retention period for these triggers is fully configurable, providing flexibility for different operational needs.

Highlights

Automatic Trigger Cleanup: Implemented an automatic cleanup strategy for completed and failed DatabaseTriggers using MongoDB's Time-To-Live (TTL) feature to prevent unbounded collection growth.
New expires_at Field: Added an expires_at field to the DatabaseTriggers model to track when a trigger document should be automatically removed by MongoDB.
Configurable Retention Period: Introduced a TRIGGER_RETENTION_DAYS setting (defaulting to 30 days) to allow configuration of how long completed/failed triggers are retained before cleanup.
TTL Index Creation: A MongoDB TTL index is now created on the expires_at field, configured to delete documents immediately once their expiration time is reached.
Updated Trigger Status Functions: The mark_as_triggered() and mark_as_failed() functions now set the expires_at timestamp for triggers, ensuring only terminal states are marked for cleanup.
Comprehensive Unit Tests: Added new unit tests to verify the correct setting of expires_at for both TRIGGERED and FAILED states, and to confirm that custom retention periods are respected.
Python SDK Version Bump: The python-sdk version has been updated from 0.0.3b1 to 0.0.3b2.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

coderabbitai · 2025-10-05T10:59:23Z

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

📝 Walkthrough

Summary by CodeRabbit

New Features
- Added configurable trigger retention policy; completed and failed triggers now automatically expire and are cleaned up based on a configurable retention window (default: 720 hours).
Documentation
- Added TRIGGER_RETENTION_HOURS environment variable documentation for configuring trigger retention period.

Walkthrough

Adds a configurable trigger retention setting and timezone-aware expires_at to triggers, creates a TTL index for terminal triggers, threads retention through trigger creation and cron flow, runs startup cleanup to remove non-expiring terminal triggers, updates main to run init tasks, and adds tests and docs.

Changes

Cohort / File(s)	Summary of changes
Configuration: retention setting `state-manager/app/config/settings.py`	Adds `Settings.trigger_retention_hours: int` and reads `TRIGGER_RETENTION_HOURS` in `Settings.from_env`.
Database model & TTL index `state-manager/app/models/db/trigger.py`	Adds `DatabaseTriggers.expires_at: Optional[datetime]` and a TTL `IndexModel` `ttl_expires_at` on `expires_at` with `expireAfterSeconds=0` and a `partialFilterExpression` for `trigger_status` in (TRIGGERED, FAILED).
Trigger cron flow updates `state-manager/app/tasks/trigger_cron.py`	Propagates `retention_hours` through `handle_trigger`, `mark_as_triggered`, `mark_as_failed`, and `create_next_triggers`; computes and persists timezone-aware `expires_at = now_utc + retention_hours`; `trigger_cron` reads settings once and supplies `trigger_retention_hours` to workers.
Cron trigger creation `state-manager/app/tasks/verify_graph.py`	Reads module-level settings and sets `expires_at` for created cron triggers using `settings.trigger_retention_hours`.
Startup cleanup tasks & lifecycle hook `state-manager/app/tasks/init_tasks.py`, `state-manager/app/main.py`	Adds `delete_old_triggers()` to remove terminal triggers lacking `expires_at`, `init_tasks()` to run it at startup, and `main.py` invokes `await init_tasks()` during lifespan startup.
Unit tests for TTL and flow `state-manager/tests/unit/tasks/test_trigger_cron.py`, `state-manager/tests/unit/test_main.py`	Adds async tests covering mark_as_triggered/failed, due-trigger querying, call_trigger_graph, create_next_triggers (DuplicateKeyError handling), handle_trigger success/failure, and trigger_cron orchestration; updates main tests to mock `init_tasks`.
Docs and misc `docs/docs/exosphere/state-manager-setup.md`, `.github/.codespellignorewords`	Documentation updated to document `TRIGGER_WORKERS` and `TRIGGER_RETENTION_HOURS`; `.codespellignorewords` adds `nin`.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant CR as trigger_cron
  participant CFG as Settings
  participant HT as handle_trigger
  participant DB as DatabaseTriggers
  participant CTRL as trigger_graph

  CR->>CFG: read trigger_retention_hours
  loop worker
    CR->>HT: handle_trigger(cron_time, retention_hours)
    HT->>DB: get_due_triggers(cron_time) -> set TRIGGERING
    alt due trigger found
      HT->>CTRL: call_trigger_graph(namespace, graph, body, request_id)
      alt success
        HT->>DB: mark_as_triggered(..., expires_at = now_utc + retention_hours)
        HT->>DB: create_next_triggers(..., expires_at = next_time + retention_hours)
      else error
        HT->>DB: mark_as_failed(..., expires_at = now_utc + retention_hours)
        HT->>DB: create_next_triggers(..., expires_at = next_time + retention_hours)
      end
    else no due triggers
      HT-->>CR: return
    end
  end

  Note over DB: TTL index `ttl_expires_at` on `expires_at` (expireAfterSeconds=0)\napplies only when trigger_status in (TRIGGERED, FAILED)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

Adding triggers to graphs #428 — Original trigger subsystem; this PR extends it by adding expires_at, TTL index, retention setting, and cleanup tasks.
Final changes for trigger beta 0.0.3b1 #435 — Modifies trigger processing/cron logic touching similar files and flows; likely overlaps in trigger lifecycle changes.

Suggested reviewers

nk-ag

Poem

I’m a tidy rabbit, I hop the night through,
I stamp an expires_at with a hop and a chew.
Old triggers tumble when their hours are done,
Cron hums, I tidy—one by one. 🥕⌛️

Pre-merge checks and finishing touches

✅ Passed checks (5 passed)

Check name	Status	Explanation
Title Check	✅ Passed	The pull request title "feat: add TTL-based cleanup strategy for DatabaseTriggers collection" clearly and concisely summarizes the main objective of the changeset. It directly corresponds to the primary implementation goal of adding automatic cleanup using MongoDB TTL indexes to prevent unbounded growth of the DatabaseTriggers collection. The title uses clear, specific language without vague terms or unnecessary noise, making it immediately understandable to team members reviewing the commit history.
Linked Issues Check	✅ Passed	The code changes comprehensively implement the requirements from issue #433. The PR adds a TTL-based cleanup strategy via: an expires_at field to the DatabaseTriggers model, a MongoDB TTL index with expireAfterSeconds=0, a configurable TRIGGER_RETENTION_HOURS setting (default 720 hours/30 days), updated mark_as_triggered() and mark_as_failed() methods that set expiration times, and unit tests validating the TTL logic. The implementation follows the proposed TTL field solution from the issue, prevents unbounded collection growth, restricts cleanup to terminal states (TRIGGERED, FAILED), and leaves PENDING/TRIGGERING triggers unaffected as designed.
Out of Scope Changes Check	✅ Passed	The vast majority of changes are directly in scope for issue #433's TTL-based cleanup strategy. Core changes include: settings configuration, DatabaseTriggers model updates, trigger_cron task logic, init_tasks cleanup functions, documentation, and comprehensive tests. A minor addition to .github/.codespellignorewords (adding "nin" to ignore words) appears to be necessary for CI/linting compliance rather than a functional code change. One potential discrepancy exists: the PR description mentions "bumped python-sdk version to 0.0.3b2," but this is not reflected in the provided raw_summary, making it unclear whether this change exists or was accurately captured.
Description Check	✅ Passed	The pull request description is related to the changeset and describes the main features implemented. It mentions the expires_at field, MongoDB TTL index, retention settings, updated trigger status methods, unit tests, and issue resolution. However, there is a discrepancy: the description states "TRIGGER_RETENTION_DAYS setting (default: 30 days)" but the raw_summary shows the actual implementation uses "TRIGGER_RETENTION_HOURS" with default 720 (which equals 30 days). Despite this naming inconsistency, the description is substantively related to the actual changes in the codebase.
Docstring Coverage	✅ Passed	Docstring coverage is 93.33% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0f7098a and bbcb7b0.

📒 Files selected for processing (4)

docs/docs/exosphere/state-manager-setup.md (5 hunks)
state-manager/app/config/settings.py (2 hunks)
state-manager/tests/unit/tasks/test_trigger_cron.py (1 hunks)
state-manager/tests/unit/test_main.py (3 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

state-manager/tests/unit/tasks/test_trigger_cron.py (3)

state-manager/app/tasks/trigger_cron.py (7)

trigger_cron (97-101)

mark_as_triggered (75-84)

mark_as_failed (37-46)

get_due_triggers (16-27)

call_trigger_graph (29-35)

create_next_triggers (48-73)

handle_trigger (86-95)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-52)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

🔇 Additional comments (3)

docs/docs/exosphere/state-manager-setup.md (1)

32-33: LGTM! Comprehensive documentation of new configuration variables.

The documentation consistently covers TRIGGER_WORKERS and TRIGGER_RETENTION_HOURS across all deployment scenarios (Docker, local development, Docker Compose, and Kubernetes). The defaults are clearly stated (1 worker, 720 hours/30 days retention) and the descriptions are helpful.

Also applies to: 89-90, 158-159, 191-192, 226-228
state-manager/tests/unit/test_main.py (1)
119-120: LGTM! Proper integration of init_tasks mocking.

The tests correctly patch and mock the new init_tasks function that's now called during startup. The patches are properly ordered and the mock parameters are added to all affected test functions.

Consider adding an assertion to verify init_tasks was called during startup, similar to how other initialization functions are verified:
# In test_lifespan_startup_success, after the context manager
mock_init_tasks.assert_awaited_once()
This would ensure the initialization tasks are actually invoked during the startup sequence.

Also applies to: 158-159, 186-187
state-manager/tests/unit/tasks/test_trigger_cron.py (1)

1-290: Excellent test coverage for TTL functionality!

This comprehensive test file provides thorough coverage of the trigger TTL cleanup logic:

Timezone handling: Properly verifies that expires_at is UTC timezone-aware (lines 60-61, 95-96)

Parametrized testing: Efficiently covers multiple scenarios for both mark_as_triggered and mark_as_failed with various retention periods

Error handling: Tests both expected errors (DuplicateKeyError) and unexpected exceptions

Integration flow: Verifies the entire flow from trigger_cron → handle_trigger → individual functions

Settings propagation: Confirms retention_hours parameter is correctly threaded through the call chain (lines 236, 265, 290)

Time accuracy: Uses reasonable tolerance (2 seconds) for time-based assertions

The test on line 279 appropriately uses 24 hours as a test value (not the default) to verify settings propagation works correctly.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2025-10-05T10:59:33Z

Codecov Report

❌ Patch coverage is 96.66667% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
state-manager/app/tasks/verify_graph.py	75.00%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

gemini-code-assist

Code Review

This pull request effectively implements a TTL-based cleanup strategy for the DatabaseTriggers collection, which is a great improvement for preventing unbounded data growth. The changes are well-structured, including the new configuration setting, the database model update with a TTL index, and the logic to set the expiration on terminal-state triggers. The accompanying unit tests are comprehensive and cover the new functionality well.

My review includes a critical performance consideration regarding the repeated calls to get_settings() within functions that may be executed in a loop. I've also provided a suggestion to improve the maintainability of the new test code by reducing duplication. Overall, this is a solid contribution.

state-manager/app/tasks/trigger_cron.py

state-manager/tests/unit/tasks/test_trigger_ttl.py

coderabbitai

Actionable comments posted: 3

📜 Review details

Configuration used: CodeRabbit UI

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 964d0f2 and c20ec6a.

📒 Files selected for processing (5)

python-sdk/exospherehost/_version.py (1 hunks)
state-manager/app/config/settings.py (2 hunks)
state-manager/app/models/db/trigger.py (2 hunks)
state-manager/app/tasks/trigger_cron.py (3 hunks)
state-manager/tests/unit/tasks/test_trigger_ttl.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (3)

state-manager/app/models/db/trigger.py (2)

state-manager/app/config/settings.py (1)

Settings (7-27)

state-manager/app/models/db/graph_template_model.py (1)

Settings (34-42)

state-manager/tests/unit/tasks/test_trigger_ttl.py (3)

state-manager/app/tasks/trigger_cron.py (3)

trigger_cron (97-100)

mark_as_triggered (74-84)

mark_as_failed (37-47)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-44)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

state-manager/app/tasks/trigger_cron.py (3)

state-manager/app/config/settings.py (1)

get_settings (34-38)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-44)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

state-manager/app/tasks/trigger_cron.py

state-manager/tests/unit/tasks/test_trigger_ttl.py

Performance optimization: - Fetch settings once at trigger_cron() level instead of per-trigger - Pass retention_days as parameter to mark functions - Prevents repeated environment variable reads in high-volume scenarios - Eliminates performance degradation when processing triggers in loops Timezone fix (critical): - Use timezone-aware datetime.now(timezone.utc) instead of naive datetime.now() - Prevents shifted expirations on non-UTC hosts - PyMongo treats naive datetimes as UTC, causing incorrect TTL windows - All expires_at timestamps now explicitly UTC for consistency CANCELLED trigger cleanup: - Added mark_as_cancelled() function for future cancellation logic - CANCELLED triggers now expire under same retention policy as TRIGGERED/FAILED - Ensures all terminal states (TRIGGERED, FAILED, CANCELLED) are cleaned up Test improvements: - Refactored using pytest.mark.parametrize to reduce duplication - Added timezone-awareness assertions to verify UTC timestamps - Combined similar tests for all 3 terminal states - Reduced test code from ~120 lines to ~85 lines while increasing coverage - 6 parameterized tests passing (3 states × 2 test scenarios) Signed-off-by: Sparsh <sparsh.raj30@gmail.com>

coderabbitai

Actionable comments posted: 2

📜 Review details

Configuration used: CodeRabbit UI

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c20ec6a and 16434a6.

📒 Files selected for processing (2)

state-manager/app/tasks/trigger_cron.py (3 hunks)
state-manager/tests/unit/tasks/test_trigger_ttl.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (2)

state-manager/app/tasks/trigger_cron.py (3)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-44)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

state-manager/app/config/settings.py (1)

get_settings (34-38)

state-manager/tests/unit/tasks/test_trigger_ttl.py (3)

state-manager/app/tasks/trigger_cron.py (4)

trigger_cron (106-110)

mark_as_triggered (84-93)

mark_as_failed (37-46)

mark_as_cancelled (48-57)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-44)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

🔇 Additional comments (6)

state-manager/tests/unit/tasks/test_trigger_ttl.py (2)

1-12: LGTM! Imports and docstring are clear.

The imports correctly include timezone for timezone-aware datetime assertions, and the module docstring clearly describes the test purpose.

14-54: LGTM! Well-structured parametrized test.

The parametrized approach successfully eliminates test duplication (addressing previous review feedback). The test correctly verifies:

The update filter targets the correct trigger ID

Both trigger_status and expires_at are set

The expires_at value is approximately 30 days from now

Timezone-aware UTC datetimes are used (lines 52-53)

The 2-second tolerance is appropriate for test timing variability.

state-manager/app/tasks/trigger_cron.py (4)

1-1: LGTM! Timezone import added.

The timezone import enables timezone-aware datetime creation, addressing the previous review concern about naive datetime usage.

37-46: LGTM! TTL logic correctly implemented.

The function now:

Accepts retention_days as a parameter (avoiding repeated get_settings() calls, addressing previous performance concern)

Uses datetime.now(timezone.utc) for timezone-aware timestamps (ensuring correct TTL behavior across timezones)

Sets both trigger_status and expires_at in a single update

84-93: LGTM! TTL logic correctly implemented.

Consistent with mark_as_failed, this function now accepts retention_days and uses timezone-aware UTC timestamps for correct TTL behavior.

95-110: LGTM! Efficient settings propagation.

The refactored design addresses the previous performance concern by calling get_settings() once at the top level (line 108) and threading retention_days through the call chain. This avoids repeated environment variable reads in the loop.

state-manager/app/tasks/trigger_cron.py

state-manager/tests/unit/tasks/test_trigger_ttl.py

python-sdk/exospherehost/_version.py

state-manager/app/config/settings.py

state-manager/app/models/db/trigger.py

state-manager/app/tasks/trigger_cron.py

state-manager/app/models/db/trigger.py

- Add comprehensive docstring to mark_as_cancelled() explaining it's reserved for future cancellation feature implementation - Rename test_trigger_ttl.py to test_trigger_cron.py to match module name - Expand parametrized tests: test all 3 mark functions with 3 retention periods (9 combinations) for consistent coverage - Add 9 new tests covering all trigger_cron.py functions: - get_due_triggers (with/without triggers) - call_trigger_graph - create_next_triggers (success, DuplicateKeyError, other exceptions) - handle_trigger (success and failure paths) - trigger_cron orchestration - Total: 18 comprehensive tests with timezone-aware datetime assertions Signed-off-by: Sparsh <sparsh.raj30@gmail.com>

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 193bf32 and 5a9893a.

📒 Files selected for processing (4)

state-manager/app/config/settings.py (2 hunks)
state-manager/app/models/db/trigger.py (2 hunks)
state-manager/app/tasks/trigger_cron.py (3 hunks)
state-manager/tests/unit/tasks/test_trigger_cron.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (3)

state-manager/tests/unit/tasks/test_trigger_cron.py (3)

state-manager/app/tasks/trigger_cron.py (7)

trigger_cron (95-99)

mark_as_triggered (73-82)

mark_as_failed (37-46)

get_due_triggers (16-27)

call_trigger_graph (29-35)

create_next_triggers (48-71)

handle_trigger (84-93)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-53)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

state-manager/app/tasks/trigger_cron.py (4)

python-sdk/exospherehost/statemanager.py (1)

trigger (37-91)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-53)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

state-manager/app/config/settings.py (1)

get_settings (34-38)

state-manager/app/models/db/trigger.py (2)

state-manager/app/config/settings.py (1)

Settings (7-27)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

state-manager/app/models/db/trigger.py

Signed-off-by: Sparsh <sparsh.raj30@gmail.com>

- Changed the partial filter expression in DatabaseTriggers to exclude PENDING and TRIGGERING statuses. - Introduced an expires_at field in create_crons to set expiration time for triggers based on retention settings, ensuring proper cleanup of triggers after their designated retention period. This enhances the management of trigger states and ensures that only relevant triggers are retained in the database.

- Added "nin" to the list of ignored words in .codespellignorewords. - Ensured consistency by maintaining the existing format of the file.

coderabbitai

Actionable comments posted: 2

📜 Review details

Configuration used: CodeRabbit UI

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 63665b8 and 094dd43.

📒 Files selected for processing (3)

.github/.codespellignorewords (1 hunks)
state-manager/app/models/db/trigger.py (2 hunks)
state-manager/app/tasks/verify_graph.py (3 hunks)

🧰 Additional context used

🧬 Code graph analysis (2)

state-manager/app/tasks/verify_graph.py (1)

state-manager/app/config/settings.py (1)

get_settings (34-38)

state-manager/app/models/db/trigger.py (2)

state-manager/app/config/settings.py (1)

Settings (7-27)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

🔇 Additional comments (5)

.github/.codespellignorewords (1)

17-18: LGTM!

Adding "nin" to the ignore list is appropriate since it's a MongoDB operator ($nin) used in the TTL index partial filter expression.

state-manager/app/models/db/trigger.py (1)

16-16: LGTM!

The expires_at field definition is correct. The optional type with default None is appropriate for triggers that shouldn't expire (PENDING, TRIGGERING).

state-manager/app/tasks/verify_graph.py (3)

13-14: LGTM! Imports are correctly added.

The new imports support the TTL-based retention functionality for cron triggers.

18-18: Verify that module-level settings initialization is intentional.

The settings are loaded once at module import time. If the TRIGGER_RETENTION_HOURS environment variable changes after the module is imported, the change won't be reflected without restarting the application.

If runtime reconfiguration is desired, consider calling get_settings() within create_crons() instead of at module level.

119-129: Implementation looks correct, pending resolution of retention period issue.

The DatabaseTriggers instantiation properly includes the new expires_at field alongside existing fields. The logic correctly propagates the calculated expiration time to the database trigger record.

Assuming the retention period calculation at line 117 is corrected per the previous comment, this implementation aligns with the PR objectives.

state-manager/app/models/db/trigger.py

state-manager/app/tasks/verify_graph.py

- Changed the default value of trigger_retention_hours from 24 to 720 in settings.py to extend the retention period for triggers, allowing for better management of trigger states.

…r setup - Updated the state manager setup documentation to include new environment variables: TRIGGER_WORKERS (default: 1) and TRIGGER_RETENTION_HOURS (default: 720) for better configuration of trigger management.

- Introduced a new module for initialization tasks that run when the server starts. - Implemented a task to delete old triggers from the DatabaseTriggers collection based on their status. - Updated the main application file to call the new initialization tasks during the lifespan of the FastAPI app, ensuring proper cleanup of outdated triggers at startup.

coderabbitai

Actionable comments posted: 6

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

state-manager/app/config/settings.py (1)
41-42: Remove or replace the module-level settings snapshot with runtime calls to get_settings().

The module-level settings = get_settings() at lines 41-42 creates a stale snapshot at import time, preventing environment variable overrides from tests or process managers. The codebase already demonstrates the better pattern in most locations (e.g., app/main.py:57, app/utils/check_secret.py:12, app/tasks/trigger_cron.py:97), where get_settings() is called at runtime when needed. Additionally, the same problematic pattern exists in app/utils/encrypter.py:7 and app/tasks/verify_graph.py:18, which also lock settings at module import time.
-# Initialize settings
-settings = get_settings() 
+# Avoid module-level snapshots; call get_settings() where needed for dynamic env overrides

♻️ Duplicate comments (1)

state-manager/app/config/settings.py (1)
16-16: Make retention default consistent with docs and add validation.

Unify default to 720 and enforce non-negative hours to prevent immediate TTL deletion when misconfigured.
-    trigger_retention_hours: int = Field(default=24, description="Number of hours to retain completed/failed triggers before cleanup")
+    trigger_retention_hours: int = Field(
+        default=720,  # 30 days
+        ge=1,
+        description="Number of hours to retain completed/failed triggers before cleanup",
+    )

📜 Review details

Configuration used: CodeRabbit UI

Review profile: ASSERTIVE

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 6dad1a7 and 8831b5b.

📒 Files selected for processing (4)

docs/docs/exosphere/state-manager-setup.md (5 hunks)
state-manager/app/config/settings.py (2 hunks)
state-manager/app/main.py (2 hunks)
state-manager/app/tasks/init_tasks.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (2)

state-manager/app/tasks/init_tasks.py (2)

state-manager/app/models/db/trigger.py (1)

DatabaseTriggers (9-52)

state-manager/app/models/trigger_models.py (1)

TriggerStatusEnum (9-14)

state-manager/app/main.py (1)

state-manager/app/tasks/init_tasks.py (1)

init_tasks (16-20)

🪛 GitHub Actions: State Manager Unit Tests

state-manager/app/main.py

[error] 66-66: Beanie error: CollectionWasNotInitialized raised during lifespan startup while accessing DatabaseTriggers. Ensure Beanie is initialized before database collection usage. Command: uv run pytest tests/ --cov=app --cov-report=xml --cov-report=term-missing --cov-report=html -v --junitxml=full-pytest-report.xml

🔇 Additional comments (1)

state-manager/app/tasks/init_tasks.py (1)

6-14: No changes required. Review comment is factually incorrect.

The review's claim that get_pymongo_collection() is "not part of Beanie's public API in v2" contradicts Beanie's actual API design. get_pymongo_collection() is the official Beanie v2 method (renamed from get_motor_collection() in v2) to access a model's underlying PyMongo Collection. The current code correctly uses this method. Additionally, get_collection() suggested in the review is a PyMongo Database method, not a Beanie Document method—it does not exist on the DatabaseTriggers class. No CollectionWasNotInitialized errors or CI failures related to this code were found in the codebase.

Likely an incorrect or invalid review comment.

docs/docs/exosphere/state-manager-setup.md

state-manager/app/config/settings.py

state-manager/app/main.py

state-manager/app/tasks/init_tasks.py

- Updated create_next_triggers function to accept retention_hours as a parameter. - Introduced expires_at calculation to set expiration time for triggers based on retention settings. - Ensured proper handling of trigger creation with expiration in the database, improving trigger management.

- Changed the default value of MONGO_DATABASE_NAME from 'exosphere' to 'exosphere-state-manager' in the state manager setup documentation for clarity and accuracy in configuration.

- Changed the default value of trigger_retention_hours in settings.py from 24 to 720 to extend the retention period for triggers. - Updated test cases in test_main.py and test_trigger_cron.py to reflect the new retention_hours parameter in create_next_triggers function, ensuring consistency in trigger management across the application.

gemini-code-assist bot reviewed Oct 5, 2025

View reviewed changes

state-manager/app/tasks/trigger_cron.py Outdated Show resolved Hide resolved

state-manager/tests/unit/tasks/test_trigger_ttl.py Outdated Show resolved Hide resolved

coderabbitai bot added the enhancement New feature or request label Oct 5, 2025

coderabbitai bot reviewed Oct 5, 2025

View reviewed changes

state-manager/app/tasks/trigger_cron.py Outdated Show resolved Hide resolved

state-manager/app/tasks/trigger_cron.py Show resolved Hide resolved

state-manager/tests/unit/tasks/test_trigger_ttl.py Outdated Show resolved Hide resolved

coderabbitai bot reviewed Oct 5, 2025

View reviewed changes

state-manager/app/tasks/trigger_cron.py Outdated Show resolved Hide resolved

state-manager/tests/unit/tasks/test_trigger_ttl.py Outdated Show resolved Hide resolved

coderabbitai bot mentioned this pull request Oct 5, 2025

Wire up mark_as_cancelled function in trigger cancellation flow #465

Open

NiveditJain reviewed Oct 6, 2025

View reviewed changes

state-manager/app/models/db/trigger.py Show resolved Hide resolved

spa-raj force-pushed the add-trigger-cleanup-strategy branch from 193bf32 to 5a9893a Compare October 6, 2025 09:07

spa-raj requested a review from NiveditJain October 6, 2025 09:10

coderabbitai bot added hacktoberfest hacktoberfest2025 labels Oct 6, 2025

coderabbitai bot reviewed Oct 6, 2025

View reviewed changes

state-manager/app/models/db/trigger.py Show resolved Hide resolved

spa-raj and others added 3 commits October 6, 2025 14:57

Removed Cancelled state from partial index

63665b8

Signed-off-by: Sparsh <sparsh.raj30@gmail.com>

chore: update codespell ignore words list

094dd43

- Added "nin" to the list of ignored words in .codespellignorewords. - Ensured consistency by maintaining the existing format of the file.

NiveditJain added the hacktoberfest-accepted label Oct 22, 2025

nin->in

6dad1a7

coderabbitai bot reviewed Oct 22, 2025

View reviewed changes

state-manager/app/models/db/trigger.py Show resolved Hide resolved

state-manager/app/tasks/verify_graph.py Show resolved Hide resolved

coderabbitai bot mentioned this pull request Oct 22, 2025

Implement API endpoint to cancel database triggers #500

Open

NiveditJain added 2 commits October 22, 2025 17:04

fix: update trigger retention hours to 720

021e4d5

- Changed the default value of trigger_retention_hours from 24 to 720 in settings.py to extend the retention period for triggers, allowing for better management of trigger states.

NiveditJain approved these changes Oct 22, 2025

View reviewed changes

coderabbitai bot added the documentation Improvements or additions to documentation label Oct 22, 2025

coderabbitai bot reviewed Oct 22, 2025

View reviewed changes

NiveditJain added 3 commits October 22, 2025 19:49

docs: update MongoDB database name in state manager setup

8af1c5d

- Changed the default value of MONGO_DATABASE_NAME from 'exosphere' to 'exosphere-state-manager' in the state manager setup documentation for clarity and accuracy in configuration.

NiveditJain merged commit a2c06a0 into exospherehost:main Oct 22, 2025
8 checks passed

This was referenced Oct 23, 2025

feat: add timezone support for CRON triggers (#438) #463

Open

Enhancement/cancel triggers #505

Open

Brijesh-Thakkar mentioned this pull request Dec 10, 2025

fix: wire mark_as_cancelled into trigger cancellation flow & replace destructive deletes with TTL-based cleanup #590

Open

feat: add TTL-based cleanup strategy for DatabaseTriggers collection #464

feat: add TTL-based cleanup strategy for DatabaseTriggers collection #464

Uh oh!

Conversation

spa-raj commented Oct 5, 2025

Uh oh!

safedep bot commented Oct 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

SafeDep Report Summary

Uh oh!

gemini-code-assist bot commented Oct 5, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

coderabbitai bot commented Oct 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Other AI code review bot(s) detected

Summary by CodeRabbit

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

codecov bot commented Oct 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

safedep bot commented Oct 5, 2025 •

edited

Loading

coderabbitai bot commented Oct 5, 2025 •

edited

Loading

codecov bot commented Oct 5, 2025 •

edited

Loading